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Abstract 

Background: Improving our understanding of processes at the core of cellular lifestyles can be aided by 
combining information from genetic analyses, high-throughput experiments and computational predictions. 

Results: We combined data and predictions derived from phenotypic, physiological, genetic and computational 
analyses to dissect the metabolic and energetic networks of the facultative photosynthetic bacterium Rhodobacter 
sphaeroides. We focused our analysis on pathways crucial to the production and recycling of pyridine nucleotides 
during aerobic respiratory and anaerobic photosynthetic growth in the presence of an organic electron donor. In 
particular, we assessed the requirement for NADH/NADPH transhydrogenase enzyme, PntAB during respiratory and 
photosynthetic growth. Using high-throughput phenotype microarrays (PMs), we found that PntAB is essential for 
photosynthetic growth in the presence of many organic electron donors, particularly those predicted to require its 
activity to produce NADPH. Utilizing the genome-scale metabolic model iRsp1095, we predicted alternative routes 
of NADPH synthesis and used gene expression analyses to show that transcripts from a subset of the 
corresponding genes were conditionally increased in a ApntAB mutant. We then used a combination of metabolic 
flux predictions and mutational analysis to identify flux redistribution patterns utilized in the ApntAB mutant to 
compensate for the loss of this enzyme. Data generated from metabolic and phenotypic analyses of wild type and 
mutant cells were used to develop iRspl 140, an expanded genome-scale metabolic reconstruction for R. 
sphaeroides with improved ability to analyze and predict pathways associated with photosynthesis and other 
metabolic processes. 

Conclusions: These analyses increased our understanding of key aspects of the photosynthetic lifestyle, highlighting 
the added importance of NADPH production under these conditions. It also led to a significant improvement in the 
predictive capabilities of a metabolic model for the different energetic lifestyles of a facultative organism. 

Keywords: Photosynthesis, Transhydrogenase, Constraint-based analysis, Metabolic modeling, Phenotype microarray, 
Rhodobacter sphaeroides 



Background 

Information about an organisms capabilities can be 
derived from a variety of sources. When genomic infor- 
mation is combined with biochemical, phenotypic or 
genetic data, functional roles and interrelationships of 
components within metabolic or regulatory networks be- 
come better defined [1-5]. Thus, to obtain a global view 
of an organisms capabilities, it is often beneficial to 
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develop models that integrate data from different types 
of experiments. In obtaining such integrated views, 
genome-scale metabolic network models can serve both 
as databases for storage and organization and as tools 
for the combination and analysis of heterogeneous data 
sets [6]. A particular interest of our laboratory is deve- 
loping an integrated understanding of metabolic net- 
works in photosynthetic microbes, because of their 
abundance in nature, the unique aspects of a solar- 
driven lifestyle and their growing importance in biotech- 
nological applications [7-9]. 

We study purple non-sulfur bacteria, a group of 
photosynthetic microbes that display great metabolic 
and energetic diversity [10]. The purple non-sulfur 
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bacterium Rhodobacter sphaeroides represents one of the 
best studied photosynthetic organisms, and has been used 
to develop models of photon capture, light-driven energy 
metabolism and other aspects of its diverse lifestyles 
[11,12]. This facultative microbe is capable of anoxygenic 
photosynthetic growth, aerobic respiration and anaerobic 
respiration [11,12]. Furthermore, R. sphaeroides has been 
studied for potential biotechnological applications inclu- 
ding the ability to produce H 2 [13-15] and ubiquinone 
[16], production of polyhydroxybutyrate, which can be 
used as a source of biodegradable plastics [17], reme- 
diation of radioactive contamination [18], and its ability to 
fix C0 2 and N 2 [7,19,20]. The available genetic, genomic 
and physiological tools [12] also make R. sphaeroides an 
excellent system in which to improve our understanding 
of solar energy capture, metabolic and energetic aspects of 
photosynthesis and other energetic pathways, and the net- 
works which regulate processes of societal and biotechno- 
logical interest. To obtain an integrated understanding of 
photosynthesis or other aspects of R. sphaeroides lifestyles 
requires the use of high-throughput data to develop better 
predictive models of its metabolic network. 

In this work, we take a systematic approach to expand 
our knowledge of the metabolic and energetic networks of 
R. sphaeroides by combining data from genetic, pheno- 
typic and transcriptional analyses with constraint-based 
modeling. We use high-through phenotypic microarrays 
to show that wild type R. sphaeroides grows on a diverse 
array of substrates and that this nutrient utilization profile 
varies significantly between photosynthetic and non- 
photosynthetic growth conditions. Using the conserved 
bioenergetic enzyme pyridine nucleotide transhydrogenase 
(PntAB) as an example, we identify carbon sources where 
recycling of pyridine nucleotides by this enzyme is essen- 
tial for photosynthetic or non-photosynthetic growth. We 
use a genome-scale metabolic model to predict flux distri- 
butions and identify alternative NADPH producing reac- 
tions that can compensate for the loss of PntAB and 
thereby explain the conditional growth of ApntAB cells on 



selected carbon sources. Transcriptional and phenotypic 
analyses of defined single and double mutants were used 
to verify the potential use of some of these alternative 
NADPH producing reactions under defined conditions. 
The new data derived from analyzing the growth of wild 
type and mutant cells were used to develop iRspll40, a 
significant update to the existing genome-scale recon- 
struction of the R. sphaeroides metabolic network [11], 
with increased coverage of metabolic pathways and im- 
proved predictive ability. iRspll40 accounts for 1140 
genes, 878 metabolites and 1416 reactions. This work 
illustrates the new insights into important cellular pro- 
cesses that can be acquired by integrating data from ge- 
netic, genomic and other complementary experiments 
into predictive models of biological systems. 

Results and discussion 

Global analysis of substrate utilization by /?. sphaeroides 

One important step in acquiring a global understanding 
of cellular processes in an organism is to develop a 
broad perspective of its metabolic repertoire. An assess- 
ment of the literature reveals that R. sphaeroides has 
been reported to grow on 27 carbon, 3 nitrogen, 1 phos- 
phorus and 4 sulfur sources [11]. In contrast, the exis- 
ting genome-scale model of R. sphaeroides metabolic 
network, iRspl095, predicted an ability to grow on a sig- 
nificantly larger number of substrates (Table 1) [11]. 
Thus, to improve our knowledge of the metabolic cap- 
abilities of R. sphaeroides, and aid subsequent analyses 
of genetic or physiological perturbations, we used phe- 
notype microarrays (PM) [21,22] to assess the ability of 
wild type (WT) cells to utilize carbon sources under 
anoxygenic photoheterotrophic conditions (using a sin- 
gle carbon source as an external electron donor; this is 
hereafter referred to as photosynthetic growth), aerobic 
and anaerobic respiratory conditions (see Methods). We 
also determined the suite of, nitrogen (N), phosphorus 
(P) and sulfur (S) sources that do or do not support 
photosynthetic growth. 



Table 1 Substrate utilization profile of R. sphaeroides under different growth conditions 


Nutrient source 


Previously known* 


Predicted by iRsp1095** 


Photo 


Based on PM assay 3 

Aero Anaerobic 15 


Carbon 


27 


64 


61 (190) 


68(190) 16(41) 


Nitrogen 


3 


31 


66 (95) 




Phosphorus 


1 


6 


42 (59) 




Sulfur 


4 


4 


18 (35) 




Total 


35 


105 


187 (379) 





*Previously known growth substrates under photosynthetic conditions. 

^Includes 28 false positive predictions and predictions for 20 substrates not included on the various Biolog plates. The simulations for substrate utilization in 
iRsp1095 were done under aerobic and photosynthetic conditions. 

a The values in parenthesis represent the total number of carbon, nitrogen, phosphorus or sulfur sources tested using PM and related approaches. 

b Photo - photosynthetic growth; Aero - aerobic respiratory growth; Anaerobic - anaerobic respiratory growth using DMSO as the terminal electron acceptor. 
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R. sphaeroides utilizes different arrays of nutrients across 
growth conditions 

The results from analysis of substrate utilization by WT R. 
sphaeroides cells (Table 1, Additional file 1: Tables S1-S4), 
significantly expands the array of compounds that support 
growth of this organism. While the carbon utilization 
profiles were largely similar during photosynthetic and 
aerobic respiratory growth, several important differences 
were observed. Eight carbon sources appeared to support 
growth photosynthetically but not aerobically, while 15 
supported growth aerobically but not photosynthetically 
(Additional file 2: Figure S1A, Additional file 1: Table SI). 
Potential causes for the observed differences might 
include: (i) longer lag times under individual conditions 
(Additional file 2: Figure SIB), which may result in an 
apparent inability to utilize the carbon source under one 
experimental condition; (ii) insurmountable metabolic, 
bioenergetic or regulatory bottlenecks (Additional file 2: 
Figure SIC); or (iii) potential differences between the data 
derived from the photosynthetic PM assay (which mea- 
sures an increase in optical density) and the aerobic PM 
assay (that measures respiration) [23]. 

Of the 53 carbon sources that were used both photosyn- 
thetically and aerobically, 41 were tested for their ability to 
support growth under anaerobic respiratory conditions 
using dimethyl sulfoxide (DMSO) as the terminal electron 
acceptor (Additional file 1: Table SI). Only 16 of these 
carbon sources were capable of supporting anaerobic 
respiratory growth (as measured by an increase in optical 
density) after 10 days of incubation. We propose that the 
inability of WT R. sphaeroides to grow in the presence of 
several carbon substrates during anaerobic respiratory 
growth is likely due to regulatory and/or bioenergetic 
constraints, as pathways required for their catabolism are 
either known or predicted to be present in the genome. 

PM assays also revealed that 66 nitrogen, 42 phos- 
phorus and 18 sulfur sources supported growth photo- 
synthetically in WT R. sphaeroides (Table 1, Additional 
file 1: Tables S2-S4). This is a number of nitrogen, phos- 
phorous and sulfur substrates which is similar to those 
shown to support growth of other well-studied faculta- 
tive bacteria like Escherichia coli [24] and Bacillus 
subtilis [25]. 

The ability of R. sphaeroides to grow on a wide variety 
of carbon, nitrogen, phosphorus and sulfur sources 
(Additional file 1: Table S8) is a further demonstration 
of its metabolic versatility. Of particular interest for 
future studies is the pattern of substrate utilization 
observed under different growth conditions, which we 
propose likely reflects regulatory differences, since en- 
zymes needed to carry out the required reactions are 
predicted to be encoded in the genome. Below we show 
that these PM analyses of WT cells provide important 
reference points for studying the effects of mutations on 



the metabolic, energetic and regulatory pathways that 
are potentially used during various modes of growth. 

Analyzing the role of PntAB under defined growth 
conditions 

To illustrate how knowledge of the substrate utilization 
profile of WT R. sphaeroides can be used to assess the 
effects of genetic perturbation on the metabolic network, 
we describe insights gained from analyzing an important 
and widely conserved energetic enzyme, pyridine nu- 
cleotide transhydrogenase (PntAB). PntAB is a hetero- 
tetrameric membrane-bound enzyme consisting of a and 
(3 subunits that catalyzes the reversible exchange of re- 
ducing equivalents between pyridine nucleotides based 
on the magnitude of the proton gradient across the cyto- 
plasmic membrane [26,27]. 



H 



periplasm 



+ NADP+ + NADH^NADPH + NAD^ 

' cytosol 



Thus PntAB plays a major role in maintaining the ba- 
lance of cellular pyridine nucleotides (NADH/NADPH). 
NADPH is a source of reducing equivalents in a large 
number of crucial anabolic pathways such as the Calvin 
cycle in autotrophic cells, fatty acid biosynthesis and tet- 
rapyrrole or pigment biosynthesis in photosynthetic or- 
ganisms [28]. 

Extensive studies have shown that E. coli PntAB ex- 
pression is induced when there is a demand for NADPH 
[29]. In addition, E. coli PntAB is required for optimal 
growth on carbon sources whose metabolism does not 
directly generate NADPH, such as glycerol [29]. E. coli 
also possesses an energy independent soluble trans- 
hydrogenase, UdhA, which is induced when there is an 
excess of NADPH (e.g., growth on acetate) and mediates 
conversion of NADPH to NADP + [29,30]. In addition to 
PntAB and UdhA, glucose-6-phosphate dehydrogenase 
(Zwf ) and isocitrate dehydrogenase (led) can help main- 
tain bacterial NADPH pools under specific conditions 
[29,30]. We compared growth of R. sphaeroides wild 
type and ApntAB cells (PntAl [31]) using PMs to iden- 
tify conditions with an increased need for NADPH in 
this bacterium. 

PntAB is conditionally essential for photosynthetic and 
aerobic respiratory growth 

Under photosynthetic conditions, only 25 carbon sources 
supported growth of PntAl compared to 61 substrates 
that were used by WT cells. Importantly, only PntAl cells 
using D -glucose achieved a final optical density that was 
equivalent to that of its parent, while PntAl cells using D- 
aspartate grew well, whereas the WT parent did not grow 
on this substrate (Table 2, Additional file 1: Table S5), 
suggesting these were the only tested carbon sources that 
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Table 2 Summary of carbon utilization in WT and PntAI 
cells under aerobic conditions 



Total number of carbon sources utilized 


Photo 3 


Aero 


PntAI 


25 


51 


WT 


61 


73 


Differences in carbon sources utilized 


Equivalent growth in PntAI and WT 


1 


40 


PntAI only 


1 


1 


Increased growth in PntAI 


0 


1 


Reduced growth in PntAI 


23 


9 


Growth in WT only 


37 


23 



a Photo - photosynthetic growth; Aero - aerobic respiratory growth. 

supported normal photosynthetic growth in PntAI. To 
independently confirm the differences observed in the 
PM assays, we compared photosynthetic growth between 
PntAI, its WT parent and a PntAI cells expressing PntAB 
from an IPTG inducible plasmid (Figure 1). The com- 
bined results of these analyses can place the carbon 
sources that support photosynthetic growth in R. sphae- 
roides into 3 groups (Figure 1A-C, Additional file 1: Table 
S7). Group I carbon sources such as D-glucose, result in 



net production of NADPH during their metabolism, 
via enzymes like the glucose-6-phosphate dehydrogen- 
ase. These Group I carbon sources support comparable 
photosynthetic growth in WT and PntAI cells (Fig- 
ure 1A, Additional file 2: Figure S2). Group II carbon 
sources (such as acetate) are incapable of supporting 
photosynthetic growth in the absence of PntAB. Me- 
tabolism of these Group II carbon sources require net 
consumption of NADPH, in addition to that required 
for anabolic processes (Figure IB, Additional file 2: 
Figure S2). Growth of PntAI using Group III carbon 
sources (such as succinate and many others, Add- 
itional file 1: Table S7), was significantly impaired 
compared to that of WT cells, and exhibited a long lag 
before growth commenced (Figure 1C). It should be 
noted however, that several Group III carbon sources 
such as succinate, failed to support photosynthetic 
growth on agar plates (Additional file 2: Figure S2), re- 
inforcing the need for PntAB activity when using these 
carbon sources. 

Combined, these data suggest that PntAB is the major 
source of NADPH for photosynthetic growth on carbon 
sources except glucose and D-aspartate. Furthermore, 



Glucose 




CD 
C 



CD 
C 
<D 
CD 
X 

Q_ 

a 
< 



100 150 
Time (hrs) 

Figure 1 Categorization of growth substrates based on NADPH demand and requirement for PntAB under photosynthetic conditions. 

(A) R. sphaeroides growth on glucose via the Entner-Doudoroff pathway includes the NADPH-generating glucose-6-phosphate (G6P) 
dehydrogenase enzyme (Zwf), which supports normal photosynthetic growth even in the absence of PntAB. (B) R. sphaeroides growth on acetate 
is predicted to occur via the ethylmalonyl-CoA (EM-CoA) pathway, involving oxidation of 2 molecules of NADPH per 3 molecules of acetyl-CoA 
(A-CoA) consumed [59]. Note that a third molecule of A-CoA used up in later steps of the EM-CoA pathway is not depicted in this illustration. 
(C) During photosynthetic growth on substrates that do not directly produce NADPH (e.g. succinate), PntAB is required. In some cases, after a 
long lag period, cells adapt and grow photosynthetically albeit at a slower growth rate (1 7.2 ± 0.56 hrs compared to 7.53 ± 0.56 hrs for wild type 
cells). Abbreviations: AA-CoA - acetoacetyl-CoA; 3HB-CoA - 3-hydroxy butyryl-CoA; Cr-CoA - crotonyl-CoA; 6PGL - phosphoglucono-6-lactone. 
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the fact that PntAl, but not its WT parent grows on 
D-aspartate suggests either that this strain contains un- 
linked mutations that allow it to metabolize this substrate 
or that the metabolism of the substrate is induced when 
the NADPH pool is reduced. This observation could also 
indicate that R. sphaeroides possesses an NADPH-linked 
pathway for aspartate catabolism [32,33], even though its 
genome does not contain an open reading frame with sig- 
nificant amino acid sequence similarity to known enzymes 
with such an activity. In the case of Group III carbon 
sources (succinate and many others), other NADPH 
producing pathways could be activated to support growth 
albeit at a slower rate. For example, the delayed photosyn- 
thetic growth that is seen with some Group III carbon 
sources might be the result of metabolic or genetic alte- 
rations that are needed to provide PntAB-independent 
routes for NADPH synthesis (see below). In addition, we 
predict that for Group II substrates (such as acetate), the 
date predict that the combined NADPH demand for me- 
tabolism and anabolic processes is too high to be provided 
by such alternative pathways. 

In contrast, PM assays show that under aerobic condi- 
tions PntAl grows similarly to its WT parent on the 
many of carbon sources assessed (Figure 2, Table 2, 
Additional file 1: Table S6). Of the 73 carbon sources 
tested, 40 allowed essentially equivalent growth (as mea- 
sured by the respiratory output of the PM assays) bet- 
ween PntAl and its WT parent. In addition, 23 carbon 
sources that allowed aerobic growth of the WT parent 
did not support aerobic respiration/growth of PntAl 



after 96 hrs of incubation, while another 9 showed 
significantly reduced aerobic respiration/growth in cells 
lacking PntAB. Both L- and D-aspartate, supported 
improved aerobic respiration/growth in PntAl cells 
compared to its WT parent (Figure 2). These results un- 
derscore potential differences in the relative need for 
PntAB activity under distinct metabolic states and 
provide a further indication that an, as yet unidentified 
NADPH-linked pathway for metabolism of aspartate and 
potentially other carbon sources exists in R. sphaeroides. 

Alternative NADPH-generating pathways can be utilized 
under different growth conditions 

For Group II and III carbon sources (such as acetate and 
succinate respectively), which require PntAB for photo- 
synthetic growth, growth of PntAl was indistinguishable 
from its WT parent under aerobic respiratory conditions 
(Figure 2, Additional file 1: Table S6, Additional file 2: 
Figure S2). One explanation for this observation is that 
alternative NADPH-generating reactions can function to 
replace PntAB under aerobic respiratory conditions. In- 
deed, iRspl095, predicts 6 other routes that could gene- 
rate NADPH under defined conditions in R. sphaeroides 
(Table 3). The relative expression of these candidate 
alternative NADPH-generating pathways was assayed 
using quantitative reverse transcriptase PCR (qRT-PCR) 
to determine if transcript levels for the respective genes 
are increased in the absence of PntAB when compared 
to a parent strain, when cells were grown by aerobic res- 
piration in cultures containing one of the 3 groups of 
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Figure 2 Growth data from Biolog PM1 and PM2A for WT and PntAl under aerobic respiratory conditions. The plots compare the 
respiration rates across 190 carbon sources between wild type and PntAl cells. Regions of the graph in red indicate better growth by WT cells 
than PntAl, while portions in green indicate better growth in PntAl. Regions in yellow represent the overlap between the kinetic data from the 
2 strains. 
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Table 3 Reactions predicted by iRsp1095 to be involved in NADPH generation 


Gene identifier 


Enzyme name 


Reaction catalyzed* 


EC number 


RSP_0239 & RSP_0240 


PntAB 


NADP + + NADH + 2 H + [p] = > NADPH + NAD + + 2 H + 


1.6.1.2 


RSPJ939 


NADPH-ferredoxin reductase 


Reduced ferredoxin + NADP + + H + <= > Oxidized ferredoxin + NADPH 


1.18.1.2 


RSP_2734 


Zwf 


D-Glucose 6-phosphate + NADP + < = > 6PGL + NADPH 


1.1.1.49 


RSPJ217 


Malic enzyme 


(S)-Malate + NADP + = > Pyruvate + C0 2 + NADPH 


1.1.1.40 


RSPJ593 








RSP_0661 


THF dehydrogenase 


mlthf + NADP + < = > methf + NADPH 


1.5.1.5 


RSP_0446 


led 


Isocitrate + NADP + < = > 2-Oxoglutarate + C0 2 + NADPH 


1.1.1.42 


RSPJ559 








RSP_1 146 & RSP_1 149 


Glutamate synthase 


2 L-Glutamate + NADP + < = > L-Glutamine + 2-Oxoglutarate + NADPH + H + 


1.4.1.13 



*6PGL - D-Glucono-1,5-lactone 6-phosphate; mlthf - 5,10-Methylenetetrahydrofolate; methf - 5,10-Methenyltetrahydrofolate; THF dehydrogenase - 
5,1 0-methylene-tetrahydrofolate dehydrogenase. 



carbon sources defined above (i.e., glucose, acetate and 
succinate) (Figure 3A-H). This analysis revealed carbon 
source-dependent differential expression of potential al- 
ternative NADPH generation pathways. During aerobic 
respiratory growth on a carbon source that requires 
NADPH consumption (acetate), the loss of PntAB was 
accompanied by an ~8 fold increase in expression of a 
putative NADPH:ferredoxin reductase gene (RSP_1939). 
When using glucose, a carbon source that generates net 
NADPH, we found that zw/(RSP_2734, glucoses-phos- 
phate dehydrogeanse) transcript levels are increased >5 
fold during aerobic respiratory growth on glucose com- 
pared to acetate or succinate, in both the WT parental 
strain and PntAl. Furthermore, transcripts for genes en- 
coding putative malic enzyme (RSP_1217), isocitrate de- 
hydrogenase (RSP_0446) and methylenetetrahydrofolate 



dehydrogenase (RSP_0661) enzymes were each in- 
creased >2 fold in PntAl during aerobic respiratory 
growth compared to its WT parent (Figure 3A-H). We 
recognize that changes in gene expression may not ne- 
cessarily result in equivalent changes in flux through the 
corresponding reactions in the direction of NADPH 
synthesis (see below). However, these data illustrate the 
potential of alternative NADPH-generating pathways 
during aerobic respiratory growth to contribute to the 
growth of PntAl when using many carbon sources. 

Interestingly, none of the transcript levels for the tested 
genes were significantly increased in the succinate-grown 
PntAl cells under aerobic respiratory conditions. This 
could indicate that the contribution of PntAB is minor 
under this condition and that alternative NADPH- 
generating pathways provide sufficient NADPH to support 



RSP_0240 (pntA) 




Succinate Glucose Acetate 



RSPJ939 




Succinate Glucose 



RSP_2734 (zwf) 




Succinate Glucose Acetate 



»3 



RSP_0446 (icd) 
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Figure 3 Transcript levels of genes encoding putative NADPH-generating enzymes in R. sphaeroides under aerobic respiratory 
conditions. Transcript levels for genes encoding enzymes predicted by iRspl 095 to be involved in NADPH generation (Table 3), were assayed via 
qRT-PCR in the presence or absence of PntAB. (A) RSP_0240 (pntA) (B) RSPJ939 (NADPH-ferredoxin reductase) (C) RSP_2734 (zwf) (D) RSP_0446 
(icd) (E) RSPJ217 (Malic enzyme) (F) RSP_0661 (5,10-methylene-tetrahydrofolate dehydrogenase) (G) RSPJ146 (Glutamate synthase) (H) 
RSP_1 593 (Malic enzyme). Transcript levels in WT cells is shown in blue bars, while those for PntAl is in red bars. All fold change values represent 
expression relative to WT succinate-grown cells, whose relative expression was set to 1. ** Significantly increased expression in PntAl relative 
to WT. 
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growth. Alternatively, other as yet unidentified NADPH- 
generating pathways or post-transcriptional events might 
exist in both wild type and ApntAB cells. 

The metabolic flux predictions from iRspl095 predict 
that gluconeogenesis is one potential route which could 
be utilized during growth on succinate in the absence of 



PntAB, to produce glucose-6-phosphate, which would 
then be metabolized via the Entner-Doudoroff pathway 
to produce NADPH (Figure 4). To test this hypothesis, 
we experimentally analyzed the impact of loss of Zwf 
under different growth and nutrient conditions. We 
found that loss of Zwf alone (strain Zwfl) did not 
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(See figure on previous page.) 

Figure 4 Predicted flux distributions during aerobic respiratory growth with succinate. Map of R. sphaeroides central carbon metabolism 
showing flux distributions predicted by iRspl 095 during aerobic growth with succinate. All fluxes are in mmol/gDW h. The fluxes in black are 
those predicted for the wild type cells, while those in red are predictions made for the hpntAB deletion strain. Reactions with no flux values are 
predicted to have a zero flux. To improve accuracy of predictions, fluxes were constrained using publicly available genome-wide expression data 
from wild type cells grown aerobically on succinate (see Methods). Green arrows indicate the predicted NADPH generating reactions under these 
conditions. The entry point of some other carbon sources utilized by R. sphaeroides are also shown (blue boxes). It should be noted that the 
predicted fluxes shown represent only one of many optimal solutions from the FBA solution space. Also note that R. sphaeroides does not 
possess a homolog of 6-Phosphogluconate dehydrogenase, which links the pentose phosphate pathway (PPP) with the Entner-Doudoroff (ED) 
pathway in some other organisms. TCA - tricarboxylic acid; G6P - Glucose 6-phosphate; F6P - Fructose 6-Phosphate; F1,6P - Fructose 1, 
6-bisphosphate; G3P - Glyceraldehyde 3-phosphate; DHAP - Dihydroxyacetone phosphate; 6PGL - phosphoglucono-6-lactone; 6PG - 6- 
Phosphogluconate; KDPG - 2-Keto-3-deoxy-6-phosphogluconate; PEP - Phosphoenolpyruvate; Pyr - Pyruvate; A-CoA - Acetyl-CoA; CIT - Citrate; 
ISOCIT - Isocitrate; 2-OG - 2-oxoglutarate; SUCC-CoA - Succinyl CoA; SUCC - Succinate; FUM - Fumarate; MAL - Malate; OAA - Oxaloacetate; 
AA-CoA - acetoacetyl-CoA; 3HB-CoA - 3-hydroxy butyryl-CoA; Cr-CoA - crotonyl-CoA; EM-CoA - Ethylmalonyl-CoA; MM-CoA - (3-methylmalyl-CoA; 
PROP-CoA - Propionyl-CoA; PHB - Polyhydroxybutyrate. 



significantly impair aerobic respiratory growth on suc- 
cinate, consistent with the prediction that this enzyme is 
not required for growth under these conditions. How- 
ever, loss of Zwf in PntAl (PntAl-Zwfl) resulted in a 
significant decrease in aerobic respiratory growth on 
succinate compared to the PntAl mutant (Figure 5A), 
suggesting that Zwf makes a significant contribution to 
NADPH production, but only in the absence of PntAB, 
even though zwf transcript levels are not increased in 



PntAl cells under this condition. Additionally, the ob- 
served growth difference between the Zwfl and PntAl- 
Zwfl mutants indicates that PntAB makes even a more 
significant contribution to NADPH production when 
cells are grown aerobically with succinate as a carbon 
source (Figure 5A). These data confirm the models pre- 
diction and suggest that Zwf is a major NADPH gene- 
rating enzyme utilized by the cell in the absence of 
PntAB during aerobic respiratory growth on succinate. 
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Under photosynthetic conditions, the PntAl-Zwfl 
double mutant was incapable of growth on either glu- 
cose, succinate or acetate (representatives of each of the 
3 major classes of carbon source we defined previously). 
However, growth is partially restored for glucose and 
succinate when the PntA-Zwfl mutant is complemented 
with a plasmid containing the zwf gene (Figure 5B,D). 
These findings indicate that Zwf also serves as a major 
route for NADPH generation during photosynthetic 
growth with these carbon sources, an observation that 
was predicted by metabolic flux analysis in iRspl095 
(Additional file 2: Figures S3 and S4). Additionally, 
growth of Zwfl cells is impaired when using glucose 
either photosynthetically or aerobically (Figure 5C,D, 
Additional file 2: Figure S2), suggesting that the Entner- 
Doudoroff pathway is the major glycolytic pathway in 
R. sphaeroides, despite significant expression of genes 
encoding enzymes of the Embden-Meyerhof-Parnas 
pathway in these cultures. This conclusion is also con- 
sistent with both experimental analysis of 13 C-glucose 
metabolism in R. sphaeroides under aerobic respiratory 
conditions [34] and with the metabolic flux predictions 
made by iRspl095 (Additional file 2: Figure S4). 

The conditional ability of alternative pathways to 
compensate for the loss of PntAB could reflect a higher 
demand for NADPH during photosynthetic or other 
energetic conditions. For example, under photosynthetic 
conditions, R. sphaeroides produces significantly larger 
amounts of fatty acids, photopigments and other com- 
ponents of the photosynthetic apparatus [35] that are 
each synthesized via NADPH-dependent pathways. 
Indeed, iRspl095 predicts there is a 2-4 fold increase 
in the demand for NADPH under photosynthetic 
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conditions in cells using succinate or glucose as a major 
carbon source (Figure 6). If this prediction is accurate, 
our data suggests that this significantly greater demand 
for NADPH during photosynthetic growth cannot be 
met by using alternative NADPH-generating pathways 
that are sufficient under aerobic respiratory conditions. 
However, iRspl095 also predicts that the need for 
NADPH during aerobic respiratory growth in the pres- 
ence of acetate is greater than that required for photosyn- 
thetic growth on either succinate or glucose, indicating 
that the cell might have the metabolic capacity to generate 
sufficient amount NADPH to support photosynthetic 
growth via these alternative pathways. Thus, additional 
studies are needed to determine if there are additional 
constraints under photosynthetic conditions which make 
alternative NADPH-generating pathways that function 
under aerobic respiratory conditions insufficient to sup- 
port growth under other energetic states. Combined, these 
data illustrate the centrality, and previously unrecognized 
importance, of the routes for NADPH production in the 
photosynthetic lifestyle of R. sphaeroides. Given the ubi- 
quitous nature of PntAB across biology, it is possible that 
this enzyme plays a major energetic role in other photo- 
synthetic and non-photosynthetic organisms. 

iRsp1140: a revised experimentally-validated genome- 
scale metabolic reconstruction for /?. sphaeroides 

The above results provide new information about the 
metabolic capabilities of R. sphaeroides that can be uti- 
lized to refine biochemically, genetically and genomically 
structured databases employed in constraint-based ana- 
lysis [36]. We previously constructed and validated a 
genome-scale metabolic reconstruction for R. sphae- 
roides, called iRspl095, using its annotated genome, 
published organism-specific data and information from 
continuous cultures of WT cells [11]. 

However, the substrate utilization predictions of 
iRspl095 could not explain a large amount of the data 
obtained during PM analysis of wild type and mutant 
cells (Figure 7A). Consequently, we performed a 2-step 
refinement and extension of iRspl095 that involved 
addition and removal of appropriate reactions, metabo- 
lites and genes to increase its agreement with the PM 
data, while also incorporating newly available metabolic 
data for R. sphaeroides since release of iRspl095 (see 
Methods). An initial refinement process resulted in 
addition of 280 reactions, 81 metabolites and 27 genes 
to iRspl095, while removing 23 reactions and 3 genes 
(Additional file 3: Tables Sl-3). Ninety-six of these 280 
additional reactions consisted of newly added enzymatic 
activities, for which no apparent genes were present in 
the annotated R. sphaeroides genome. To identify candi- 
date enzymes for these reactions, we conducted 
new BLAST analyses utilizing protein sequence of 
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Figure 7 R. sphaeroides substrate utilization and transport. A detailed comparison between the predicted substrate utilization in iRspl 095 
(A) and iRspl 140 (B) with data obtained from PM analysis. G-G - Growth (predicted by model) and Growth (observed in PM); G-NG - Growth 
(predicted by model) and No Growth (observed in PM); NG-G - No Growth (predicted by model) and Growth (observed in PM); NG-NG - No 
Growth (predicted by model) and No Growth (observed in PM). Percentage represents agreement between predictions and PM data from 
photosynthetic analysis only. 



experimentally verified enzymes from other organisms. 
These searches resulted in the identification and anno- 
tation of 6 new candidate enzymes. An additional 1 1 en- 
zymes already included in iRspl095, with previously 
predicted functions, were identified as candidates to 
catalyze new reactions in the model (Additional file 3: 
Tables S4). 

Of the 280 reactions added to iRspl095, 92 are pre- 
dicted membrane transport reactions. Organisms utilize 
a variety of transporters to import nutrients [37,38]. 
However, R. sphaeroides appears to favor the use of 
ATP-dependent ABC transporters (-7.84% of its genome 
is dedicated to ABC transport functions compared to 
4.9% in E. coli [39]). An analysis of the R. sphaeroides 
genome reveals that of the genes encoding ABC trans- 
porters with no known or predicted functions, 11 are or- 
ganized in operons encoding proteins with an ATPase 
domain, an integral membrane permease domain and a 
substrate-specific substrate binding domain (typical of 

Table 4 Predicted R. sphaeroides ABC transporter operons 
tested for substrate specificity using Biolog PM 

Operons Identified substrate 3 
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importers [40]) (Table 4). To test the function of these 
putative transport operons, we deleted the gene for each 
of these 11 predicted substrate binding proteins and 
assessed growth of the resulting mutants using PM. This 
analysis, plus subsequent liquid growth studies led to 
identification of substrate-specific growth defects in mu- 
tants lacking individual genes for candidate transporters 
(Additional file 2: Figure S6). Based on this data set 
we have provisionally identified substrates for 5 pre- 
viously uncharacterized transporters (Table 4). These 
transport reactions were incorporated into iRspl095 by 
the addition of 15 genes, 2 reactions and one metabolite 
(Additional file 3: Tables S4). 

These updates resulted in a refined metabolic model 
for R. sphaeroides, iRspl 140, consisting of 1416 reac- 
tions, 878 metabolites and accounting for 1140 genes 
within its genome (other properties of iRspl 140 are 

Table 5 Comparison of the properties of iRspl 095 and 
iRspl 140 



a ND - Not Determined. Specific substrates for these transporters could not be 
determined from PM assay and the analysis of mutants lacking single genes in 
each of these predicted operons (see text). 
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summarized in Table 5). Overall, iRspll40 includes a 
larger number of reactions, metabolites and genes, and 
it is supported by increased physiological, genetic and 
genomic evidence for the inclusion of many components 
in the network. The predictions of iRspll40 are in better 
agreement with the PM data compared with those of 
iRspl095 (Figure 7), verifying the improved predictive 
ability of iRspll40. In addition, predictions made by 
iRspll40 for growth rate or production of H 2 , poly- 
hydroxybutyrate and C0 2 evolution were in excellent 
agreement with published data [11] (Additional file 3: 
Table S5). We expect that iRspll40 will enable im- 
proved modeling of metabolic behavior under a larger 
number of conditions. In addition, the increased number 
of genes in iRspll40 provides a larger set of targets for 
assessing the effects of genetic perturbations on growth 
or metabolic activity under both photosynthetic and 
non-photosynthetic conditions. 

Conclusions 

While the development of high-throughput or global ap- 
proaches can provide large amounts of data, the task of 
extracting meaningful biological insight from this infor- 
mation is still challenging [4]. To gain new biological in- 
sights, constraint-based and other modeling approaches 
can be used to integrate various data sets [1-3,41]. 

In this study, we took an integrated approach to gain 
new insight into the metabolic, energetic and photo- 
synthetic lifestyles of R. sphaeroides. We extended the 
number of nutrients that can support growth of WT cells. 
We also showed that a conserved bioenergetic enzyme 
(PntAB) which can provide reduced pyridine nucleotides 
is essential for photosynthetic growth on many carbon 
sources. We used a genome-scale model for R. sphaeroides 
to make flux predictions, as well as generate and test hy- 
potheses on alterative NADPH-generating pathways that 
allow growth in the absence of PntAB. The products of 
various anabolic pathways require NADPH that is derived 
mainly from PntAB activity, so exploiting these and other 
alternative NADPH generating pathways we identified, 
could improve growth and metabolic end products de- 
rived from photosynthetic and non-photosynthetic wild 
type cells or those engineered to produce compounds of 
biotechnological interest. 

Previous studies have shown the utility of high- 
throughput data sets in refining and validating genome- 
scale metabolic models [42-44], We used our data to 
produce a 2 nd generation genome-scale reconstruction 
for R. sphaeroides, iRspll40 with significantly improved 
coverage of metabolic functions and predictions that are 
in better agreement with experimental data. iRspll40, 
provides an improved depiction of the R. sphaeroides 
metabolic network, so it will be useful in studying 



photosynthesis, as well as a wider range of metabolic 
processes in this and related organisms. 

Methods 

Bacterial strains and growth conditions 

R. sphaeroides strains 2.4.1 and Ga were used as pa- 
rental strains. All mutants were made in strain 2.4.1 
except PntAl [31] and Zwfl, which were constructed 
in Ga, and PntAl-Zwfl, which was constructed in 
PntAl (Additional file 4: Table SI). E. coli DH5a was 
used as a plasmid host, and E. coli SI 7-1 was used to 
conjugate DNA into R. sphaeroides, 

R. sphaeroides cultures were incubated at 30°C in 
Sistroms Minimal Medium (SMM) [45] lacking glutamate 
and aspartate, with succinate (33.9 mM), or an alternative 
sole carbon source. The molar concentration of carbon 
atoms of the carbon source was kept constant at 
135.5 mM. Photosynthetic cultures were incubated in 
screw capped tubes at a light intensity of -10 W/m 2 , while 
anaerobic respiratory cultures were incubated in screw 
capped tubes in the dark with the media supplemented 
with 0.9% DMSO. Aerobic cultures were shaken in flasks. 
Optical density of photosynthetic cells in screw capped 
tubes was measured using a Klett-Summerson photometer 
and is expressed in Klett units (1 Klett unit equals 
approximately 10 7 cells/mL). Other optical density mea- 
surements were made by measuring optical density at 600 
or 650 nm in a spectrophotometer. When required, the 
media was supplemented with 100 \iM IPTG, 25 (ig/mL 
kanamycin or 25 [xg/mL spectinomycin. E. coli cells were 
grown in Luria Bertani medium at 37°C, supplemented 
with 25 (ig/mL kanamycin where needed. 

Phenotype microarray analysis 

To determine the substrate utilization profiles of R. 
sphaeroides, phenotype microarrays (PMs) were used 
with a few modifications. To assay aerobic respiratory 
growth on different carbon sources (Biolog PM1 and 
PM2A), cells were grown on SMM agar plates aerobi- 
cally for 3 days. Cells were swabbed from the agar plates 
and suspended in 4 mL inoculation fluid (Biolog inc.) to 
an OD 600 of 0.38. Two mL of this mixture was placed in 
10 mL of inoculation fluid (IF) containing 24 \iL of 
tetrazolium-based dye A (Biolog inc.), resulting in a final 
OD 600 of -0.07. Then 1.2 \iL of vitamin solution 
(1 g nicotinic acid, 0.5 g thiamine-HCl and 0.01 g biotin 
in 100 mL of water) was added and 100 \iL was dis- 
pensed into each well of a 96 well plate. Cultures were 
incubated at 30 °C for 72 to 96 hrs in an OmniLog plate 
reader (Biolog inc). 

To assay photosynthetic growth, 10 mM NaHC0 3 , 
0.4 mM sodium thioglycolate and 1 \iM methylene green 
were added to the IF and this was kept in an anaerobic 
chamber for ~7 days with periodic shaking to facilitate it 
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becoming anaerobic (the methylene green in the IF turns 
colorless once oxygen is removed). R. sphaeroides cells 
were grown photosynthetically on SMM agar plates and 
PM plates set up as described above, except that the 
tetrazolium dye was omitted, as thioglycolate reduces 
the tetrazolium-based dye turning it purple independent 
of cellular respiration. The PM plates were put in an 
anaerobic chamber, inoculated, placed in heat sealed an- 
aerobic bags (Biolog inc.) [46] and incubated under con- 
stant illumination (light intensity of -10 W/m 2 ) at 30 °C 
for 72 to 96 hrs, after which OD 650 readings were taken. 
Anaerobic indicator strips and ageless oxygen absorbers 
(MITSUBISHI Gas Chemical America, Inc.) were placed 
in the sealed bags to report on and maintain an anaer- 
obic environment throughout the experiment. 

Any growth in the negative control well (Al) was 
subtracted from the measured optical density for both 
aerobic and photosynthetic PM. A threshold OD 650 of 
0.05 (after background correction) was used as a base- 
line for scoring photosynthetic growth, as this was the 
highest value obtained from any well in control expe- 
riments where cells were kept in the dark. A threshold 
of 5 Omnilog units (after background correction) was 
used as a baseline for aerobic respiratory growth. Only 
carbon sources that resulted in reproducible growth 
above the baseline across all replicates were considered 
to be growth substrates. 

To assay photosynthetic growth on different nitrogen, 
phosphorus or sulfur sources (Biolog PM3B and PM4A), 
R. sphaeroides cells were grown aerobically for 5 days on 
a modified R2A agar [22,47] (0.25 g of yeast extract, 
0.25 g of Proteose Peptone, 0.25 g of Casamino Acids, 
0.12 g of K 2 HP0 4 , 0.025 g of MgS0 4 .7H 2 0, 0.5 g of 
sodium pyruvate and 15 g of agar per liter of water). 
Plates were set up as described for photosynthetic 
growth with the addition of 20 mM sodium succinate 
and 2 uM ferric citrate. Cell cultures were grown under 
constant illumination (10 W/m 2 ) at 30°C for 48 hrs, 
after which OD 650 readings were taken. Aerobic growth 
on the different nitrogen, phosphorus and sulfur sources 
is not reported due to significant background growth in 
the negative control wells, an issue that has been ob- 
served previously [25]. 

Due to comparatively slow growth rates of R. sphae- 
roides under anaerobic respiratory conditions, evaluation 
of these growth modes could not be reliably conducted 
with Biolog PM plates due to media evaporation. Thus, 
to assay anaerobic respiratory growth on different car- 
bon sources, 96 well microwell plates (Fisher Scientific) 
were set up to analyze 41 of the carbon sources identi- 
fied as R. sphaeroides growth substrates from PM (see 
Additional file 1: Table SI for a list of these substrates). 
The carbon sources were normalized for the total num- 
ber of carbon atoms in each compound (135.5 mM). 



R. sphaeroides cells were grown aerobically in SMM and 
centrifuged. Cells were then washed with SMM media 
lacking a carbon source (SMM no carbon), suspended in 
anaerobic SMM no carbon (which had been kept in an 
anaerobic chamber for at least 4 days) to an OD 600 
of -0.1 and DMSO was added to a final concentration 
of 0.9%. Then, 300 uL of a cell suspension was dispensed 
into wells containing a different carbon source in an an- 
aerobic chamber. Plates were incubated at 30 °C for 10 days 
with continuous shaking in a Tecan M200 plate reader lo- 
cated within the anaerobic chamber, with OD 650 readings 
taken every 6 minutes. Alternatively, plates were sealed in 
anaerobic bags, wrapped in foil and incubated at 30 °C for 
10 days with periodic shaking. 

Construction of mutants 

All R. sphaeroides mutants we constructed contained 
in-frame markerless deletions. Briefly, regions span- 
ning -1500 bp upstream and downstream of the target 
gene were amplified using sequence specific primers 
containing restriction sites for EcoRI, Xbal or Hindlll. 
These fragments were digested with the appropriate 
restriction enzymes and ligated into pK18mobsacB 
plasmid [48], digested with EcoRI and Hindlll, by 
three-way ligation to generate the various gene dele- 
tion constructs, which were confirmed by sequencing 
(Additional file 4: Table SI and S2). The pK18mobsacB- 
based plasmids were separately mobilized from E. coli 
S17-1 into R. sphaeroides strains. Cells in which the plas- 
mid had successfully integrated into the genome via hom- 
ologous recombination were identified by selection on 
SMM plates supplemented with kanamycin. These cells 
were then grown overnight in SMM without kanamycin. 
Cells that had lost the deletion plasmid (and thus the sacB 
gene) via a second recombination event were identified by 
growth on SMM plates supplemented with 10% sucrose. 
Individual gene deletions were confirmed by PCR and 
sequencing with specific primers. 

Ectopic expression plasmids were made by amplifying 
the target genes from the genome using sequence spe- 
cific primers containing restriction sites for Ndel and 
Bglll, Hindlll or BamHI. These DNA fragments were 
digested with the appropriate enzymes and cloned into 
pIND5 digested with the same enzymes. These plasmids 
were conjugated from E. coli S17-1 into the relevant 
R. sphaeroides mutant. Cells which harbor the desired 
plasmid were identified by selection on SMM plates 
supplemented with kanamycin. 

RNA extraction, qRT-PCR and microarray analyses 

RNA was isolated from exponential phase cultures of 
R. sphaeroides strains that were grown either photosyn- 
thetically in 16 mL screw cap tubes or aerobically in 
500 mL conical flasks. RNA isolation and subsequent 
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cDNA synthesis were performed as previously described 
[49]. qRT-PCR experiments were conducted in triplicate 
for each biological replicate using SYBR Green Jump- 
Start Taq ReadyMix (Sigma- Aldrich). Relative expression 
was determined via the 2- AAC T method with efficiency 
correction [50]. R. sphaeroides rpoZ was used as a 
housekeeping gene for normalization. Primers used in 
this analysis are provided in Additional file 4: Table S2. 

Constraint based analysis and model refinement 

Separate stoichiometric matrices, S m x n , were generated 
from the reconstructions (i.e., iRspl095 and iRspll40) 
with the rows (m) representing metabolites, the columns 
(n) representing reactions and entries in the matrices 
representing stoichiometric coefficients for metabolites 
involved in each reaction. Flux balance analysis (FBA) 
[51] was used to simulate in silico growth by solving the 
linear programming problem: 

maXVBiomass 

s.t 

S-v = 0 

v min <v<v max 

where v B i 0 mass is the flux through biomass objective 
function; v is the vector of steady state reaction fluxes; 
and v min and v max are the minimum and maximum 
allowable fluxes. The values in v min and v max were set 
to -100 and 100 mmol/g DW h for reversible reactions 
and 0 and 100 mmol/g DW h for forward only reactions. 
During simulation, all exchange reactions were assigned 
as being forward only (allowing metabolites to be se- 
creted into the medium but not taken up), except the 
exchange reactions for media components required for 
cell growth, which were set to measured values for limit- 
ing substrates such as ammonia, or allowed to be freely 
exchanged with the extracellular space, i.e., -100 < 
v < 100. The non-growth associated ATP maintenance 
limit was set to 8.39 mmol/gDW h [24]. Flux variability 
analysis [52] was also used to determine minimum and 
maximum allowable flux through reactions in the 
network. 

Initial simulations with iRspl095 in which the trans - 
hydrogenase reaction was deleted resulted in the predic- 
tion of optimal growth, suggesting alternative NADPH 
generating reactions existed in the metabolic network. 
Analysis of iRspl095 revealed that it includes at total of 
61 NADPH requiring reactions, of which only 29 were in- 
dependently non-essential and capable of functioning in 
the direction of NADPH synthesis. To identify all reac- 
tions within iRspl095 capable of producing NADPH to 
support growth, all 29 non-essential NADPH-requiring 
reactions within iRspl095, capable carrying flux the direc- 
tion of NADPH synthesis, were turned off. This resulted 



in a predicted growth rate of 0. Optimal growth was re- 
stored solely by turning on the transhydrogenase reaction, 
consistent with transhydrogenase being sufficient for ge- 
nerating NADPH required for growth. All other reactions 
capable of independently restoring growth, while the other 
NADPH-requiring reactions were still off, were consid- 
ered as a candidate NADPH producing reaction (Table 3). 

To assess the predicted NADPH demand during 
aerobic or photosynthetic growth across growth substrates 
(i.e., succinate, glucose and acetate), all predicted NADPH 
generating reactions (Table 3) set to have a zero flux, 
except the transhydrogenase reaction. Using a previously 
described mixed integer linear programming approach 
[53,54], 1000 alternative optimal FBA solutions were iden- 
tified under each condition. The flux through the trans- 
hydrogenase reaction, and thus the amount of NADPH 
predicted to be required, under each condition was ave- 
raged over the 1000 alternative optimal solutions and this 
average was used as an estimate of NADPH demand 
under those conditions. 

To predict fluxes through central metabolism, we 
used an extension of FBA called E-flux [55]. E-flux 
limits the maximum and minimum fluxes (v max and v min 
respectively) through the reactions in the network based 
on genome-wide gene expression measurements. To 
achieve this publicly available microarray data obtained 
from cells grown on succinate and glucose (GEO plat- 
form GPL162), as well as from cells grown acetate (this 
study), were normalized and used to constrain the fluxes 
through each reaction in the network as previously des- 
cribed [55]. For reactions without gene-to-protein-to- 
reaction (GPR) assignments, the fluxes through these 
reactions were allowed to have a v max of 100 mmol/g 
DW h and a v min of 0 or -100 mmol/g DW h, if the re- 
actions were forward only or reversible respectively. For 
reactions catalyzed by isozymes, the expression value of 
gene for the isozyme with the highest expression was 
used to constrain the reaction, while for multi-subunit 
enzymes the gene for the subunit with the lowest 
expression was used to constrain the reaction. After set- 
ting the upper and lower bounds, subsequent simula- 
tions were conducted with FBA as described above. 

The previously published genome-scale model for 
R. sphaeroides iRspl095 [11] was used as the starting 
point for a 2-step model refinement. In the first step, 
PM data was used to guide model refinement, which in- 
volved the manual addition and removal of reactions to 
bring it into better alignment with the PM data. PM data 
for carbon (C), nitrogen (N), phosphorus (N) and sulfur 
(S) utilization were compared to model predictions from 
FBA simulations in which equivalent compounds were 
provided as the sole sources of these nutrients. The 
uptake rate of the tested carbon source was set 
to -4 mmol/g DW h, while that of N, P or S sources 
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Additional file 2: Additional figures. File containing additional figures 
S1-S6. 

Additional file 3: Details of iRspl 140. Excel file containing tables with 
the details of the metabolic model iRspl 140. It contains eight tables 
(Tables S1-8). 

Additional file 4: Strains and Plasmids. An excel file with strains and 
plasmids used in this paper. It contains 2 tables (Tables S1 and S2). 

Additional file 5: iRspl 140 in SBML format. SBML format of iRspl 140 
for distribution and use in other modeling environments. 



was set to -1 mmol/g DW h, as these are in the range of 
uptake rates typically observed in R. sphaeroides [11]. 
Succinate was used as the C source when testing 
for N, P and S utilization, while ammonium, inorganic 
phosphate and inorganic sulfate served as N, P and S 
sources when accessing C utilization (consistent with the 
PM analysis). For these simulations, nutrients which 
resulted in a FBA predicted growth rate greater than 
zero were considered growth substrates. No Growth- 
Growth (NGG - no growth predicted by model but 
growth observed in PM) inconsistencies were manually 
rectified by addition of appropriate transport and/or en- 
zymatic reactions from the multi-organism databases 
KEGG [56] and BRENDA [57,58]. The required enzym- 
atic reaction(s) were added to the model based on one 
of the following 2 criteria: (i) the presence genes in the 
R. sphaeroides genome encoding the proteins potentially 
capable of catalyzing the new reaction(s) to be added; 
and (ii) if no putative enzymes were identified, the meta- 
bolic route that required the addition of the fewest reac- 
tions to iRspl095 was selected. Growth-No Growth 
(GNG - growth predicted by model but no growth ob- 
served in PM) inconsistencies in iRspl095 were resolved 
by removal of transport reactions for the substrate in 
question, when this did not introduce and new incon- 
sistencies with the PM data. 

In a second step of model refinement, putative en- 
zymes capable of catalyzing reactions added to 
iRspl095 where identified by BLAST searches using 
the protein sequences from other organisms previ- 
ously verified to carry out the reaction in question. 
A BLAST E-value cutoff of 10e- 20 was selected as a 
threshold for significance. Enzymatic functions not 
previously included in iRspl095 and which were 
encoded by genes without any previously defined 
specific function were considered as newly annotated 
genes, while those with previously defined putative 
functions were considered as having additional func- 
tionality (Additional file 3: Table S4). Updated infor- 
mation from KEGG [56] database, new information 
obtained from mutant analysis in this study, and 
data from recent literature searches were incorpo- 
rated to generate iRspl 140. iRspl 140 in SBML for- 
mat is provided in Additional file 5 and can be 
accessed in the BioModels database with ID 
MODEL1304240000. 



Additional files 



Additional file 1: Phenotype microarray data. This is an excel file 
containing results from Biolog phenotype microarray analysis of R. 
sphaeroides wild-type and mutant cells across a variety of growth 
conditions and nutrient sources. It contains eight tables (Tables S1-8). 
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